Recording apparatus, recording method, and record medium

ABSTRACT

A recording apparatus for recording video data to a record medium is disclosed, that comprises an encoding means for encoding video data in a group structure of a plurality of frames corresponding to a compression-encoding process in a combination of an inter-frame predictive encoding process and a motion compensative process, a transforming means for transforming the data structure of encoded video data that is output from the encoding means into a file structure that can be processed by a computer software program without a dedicated hardware portion so that moving pictures and so forth are synchronously reproduced, and a recording means for recording data having the file structure to a record medium, wherein the file structure has a first data unit and a second data unit, the second data unit being a set of the first data units, and wherein at least one data structure is matched with the first data unit.

BACKGROUND OF THE INVENTION

1. Field of the Invention

The present invention relates to a recording apparatus, a recordingmethod, and a record medium that allow accessibility and editingefficiency to be improved.

2. Description of the Related Art

In recent years, as a multi-media system software program, QuickTime isknown. The QuickTime is a software program that allows data that varieson time base (this data is referred to as movie) to be handled. A moviecontains a moving picture, a voice, and a text. Currently, a QuickTimefile format is available as a Macintosh platform of Apple. The QuickTimefile format is an MPEG-1 (Moving Picture Experts Group phase 1) programstream file storage format of which a video elementary stream and anaudio elementary stream are multiplexed on time base). In the storageformat, the entire MPEG-1 file (namely, one whole closed scene) istreated as a sample of the QuickTime file format regardless of theduration thereof. Such a large sample is treated as one large chunk.

In addition, audio data and video data are stored together on one track(or one medium) in the QuickTime file format. As a new medium type thatrepresents such data portions contained in a large sample or a largechunk, MPEG Media has been defined.

However, the accessibility and editing efficiency of a particular typeof data contained in a large sample deteriorate. To allow a computer toreproduce and edit a QuickTime movie file, video data and audio datarecorded on a record medium (for example, an optical disc) of theportable recording and reproducing apparatus with built-in camera may beconverted into a QuickTime file format. In this case, the accessibilityand editing efficiency of a particular type of data should be furtherimproved. This problem applies to an audio data recording andreproducing apparatus as well as such a video data recording andreproducing apparatus.

OBJECTS AND SUMMARY OF THE INVENTION

Therefore, an object of the present invention is to provide a recordingapparatus, a recording method, and a record medium that allow theaccessibility and editing efficiency to be improved in the case thatdata having a file structure corresponding to a multimedia data formatsuch as QuickTime is recorded to a record medium.

A first aspect of the present invention is a recording apparatus forrecording video data to a record medium, comprising an encoding meansfor encoding video data in a group structure of a plurality of framescorresponding to a compression-encoding process in a combination of aninter-frame predictive encoding process and a motion compensativeprocess, a transforming means for transforming the data structure ofencoded video data that is output from the encoding means into a filestructure that can be processed by a computer software program without adedicated hardware portion so that moving pictures and so forth aresynchronously reproduced, and a recording means for recording datahaving the file structure to a record medium, wherein the file structurehas a first data unit and a second data unit, the second data unit beinga set of the first data units, and wherein at least one data structureis matched with the first data unit.

A second aspect of the present invention is a recording apparatus forrecording video data to a rewritable optical disc, comprising anencoding means for encoding video data corresponding to acompression-encoding process, a transforming means for transforming thedata structure of encoded video data that is output from the encodingmeans into a file structure that can be processed by a computer softwareprogram without a dedicated hardware portion so that moving pictures andso forth are synchronously reproduced, and a recording means forrecording data having the file structure to an optical disc, wherein thefile structure has a first data unit and a second data unit, the seconddata unit being a set of the first data units, and wherein the seconddata unit is matched with a successive record length of data written tothe optical disc.

A third aspect of the present invention is a recording apparatus forrecording audio data to a rewritable optical disc, comprising atransforming means for transforming the data structure of audio data orencoded audio data into a file structure that can be processed by acomputer software program without a dedicated hardware portion so thatmoving pictures and so forth are synchronously reproduced, and arecording means for recording data having the file structure to theoptical disc, wherein the file structure has a first data unit and asecond data unit, the second data unit being a set of the first dataunits, and wherein the second data unit is matched with a successiverecord length of data written to the optical disc.

A fourth aspect of the present invention is a recording apparatus forrecording video data and audio data to a record medium, comprising avideo encoding means for encoding video data in a group structure of aplurality of frames corresponding to a compression-encoding process in acombination of an inter-frame predictive encoding process and a motioncompensative process, an audio output means for outputtingcompression-encoded or non-compressed audio data, a means fortransforming the data structure of encoded video data that is outputfrom the video encoding means and audio data that is output from theaudio output means into a file structure that can be processed by acomputer software program without a dedicated hardware portion so thatmoving pictures and so forth are synchronously reproduced andmultiplexing the encoded video data and the audio data having the filestructure, and a recording means for recording multiplexed data havingthe file structure to a record medium, wherein the file structure has afirst data unit and a second data unit, the second data unit being a setof the first data units, and wherein at least one data structure of theencoded video data is matched with the first data unit.

A fifth aspect of the present invention is a recording apparatus forrecording video data and audio data to a rewritable optical disc,comprising a video encoding means for encoding video data in a groupstructure of a plurality of frames corresponding to acompression-encoding process in a combination of an inter-framepredictive encoding process and a motion compensative process, an audiooutput means for outputting compression-encoded or non-compressed audiodata, a means for transforming the data structure of encoded video datathat is output from the video encoding means and audio data that isoutput from the audio output means into a file structure that can beprocessed by a computer software program without a dedicated hardwareportion so that moving pictures and so forth are synchronouslyreproduced and multiplexing the encoded video data and the audio datahaving the file structure, and a recording means for recordingmultiplexed data having the file structure to an optical disc, whereinthe file structure has a first data unit and a second data unit, thesecond data unit being a set of the first data units, and wherein thesecond data unit is matched with a successive record length of whichdata is successively written to the optical disc.

A sixth aspect of the present invention is a recording method forrecording video data to a record medium, comprising the steps ofencoding video data in a group structure of a plurality of framescorresponding to a compression-encoding process in a combination of aninter-frame predictive encoding process and a motion compensativeprocess, transforming the data structure of encoded video data into afile structure that can be processed by a computer software programwithout a dedicated hardware portion so that moving pictures and soforth are synchronously reproduced, and recording data having the filestructure to a record medium, wherein the file structure has a firstdata unit and a second data unit, the second data unit being a set ofthe first data units, and wherein at least one data structure is matchedwith the first data unit.

A seventh aspect of the present invention is a recording method forrecording video data to a rewritable optical disc, comprising the stepsof encoding video data corresponding to a compression-encoding process,transforming the data structure of encoded video data into a filestructure that can be processed by a computer software program without adedicated hardware portion so that moving pictures and so forth aresynchronously reproduced, and recording data having the file structureto an optical disc, wherein the file structure has a first data unit anda second data unit, the second data unit being a set of the first dataunits, and wherein the second data unit is matched with a successiverecord length of data written to the optical disc.

An eighth aspect of the present invention is a recording method forrecording audio data to a rewritable optical disc, comprising the stepsof transforming the data structure of audio data or encoded audio datainto a file structure that can be processed by a computer softwareprogram without a dedicated hardware portion so that moving pictures andso forth are synchronously reproduced, and recording data having thefile structure to the optical disc, wherein the file structure has afirst data unit and a second data unit, the second data unit being a setof the first data units, and wherein the second data unit is matchedwith a successive record length of data written to the optical disc.

A ninth aspect of the present invention is a recording method forrecording video data and audio data to a record medium, comprising thesteps of encoding video data in a group structure of a plurality offrames corresponding to a compression-encoding process in a combinationof an inter-frame predictive encoding process and a motion compensativeprocess, outputting compression-encoded or non-compressed audio data,transforming the data structure of encoded video data and audio datainto a file structure that can be processed by a computer softwareprogram without a dedicated hardware portion so that moving pictures andso forth are synchronously reproduced and multiplexing the encoded videodata and the audio data having the file structure, and recordingmultiplexed data having the file structure to a record medium, whereinthe file structure has a first data unit and a second data unit, thesecond data unit being a set of the first data units, and wherein atleast one data structure of the encoded video data is matched with thefirst data unit.

A tenth aspect of the present invention is a recording method forrecording video data and audio data to a rewritable optical disc,comprising the steps of encoding video data in a group structure of aplurality of frames corresponding to a compression-encoding process in acombination of an inter-frame predictive encoding process and a motioncompensative process, outputting compression-encoded or non-compressedaudio data, transforming the data structure of encoded video data andaudio data into a file structure that can be processed by a computersoftware program without a dedicated hardware portion so that movingpictures and so forth are synchronously reproduced and multiplexing theencoded video data and the audio data having the file structure, andrecording multiplexed data having the file structure to an optical disc,wherein the file structure has a first data unit and a second data unit,the second data unit being a set of the first data units, and whereinthe second data unit is matched with a successive record length of whichdata is successively written to the optical disc.

An eleventh aspect of the present invention is a record medium on whicha program for recording video data to a record medium has been recorded,the program causing a computer to perform the steps of encoding videodata in a group structure of a plurality of frames corresponding to acompression-encoding process in a combination of an inter-framepredictive encoding process and a motion compensative process,transforming the data structure of encoded video data into a filestructure that can be processed by a computer software program without adedicated hardware portion so that moving pictures and so forth aresynchronously reproduced, and recording data having the file structureto a record medium, wherein the file structure has a first data unit anda second data unit, the second data unit being a set of the first dataunits, and wherein at least one data structure is matched with the firstdata unit.

A twelfth aspect of the present invention is a record medium on which aprogram for recording video data to a rewritable optical disc has beenrecorded, the program causing a computer to perform the steps ofencoding video data corresponding to a compression-encoding process,transforming the data structure of encoded video data into a filestructure that can be processed by a computer software program without adedicated hardware portion so that moving pictures and so forth aresynchronously reproduced, and recording data having the file structureto an optical disc, wherein the file structure has a first data unit anda second data unit, the second data unit being a set of the first dataunits, and wherein the second data unit is matched with a successiverecord length of data written to the optical disc.

A thirteenth aspect of the present invention is a record medium on whicha program for recording audio data to a rewritable optical disc has beenrecorded, the program causing a computer to perform the steps oftransforming the data structure of audio data or encoded audio data intoa file structure that can be processed by a computer software programwithout a dedicated hardware portion so that moving pictures and soforth are synchronously reproduced, and recording data having the filestructure to the optical disc, wherein the file structure has a firstdata unit and a second data unit, the second data unit being a set ofthe first data units, and wherein the second data unit is matched with asuccessive record length of data written to the optical disc.

A fourteenth aspect of the present invention is a record medium on whicha program for recording video data and audio data to a record medium hasbeen recorded, the program causing a computer to perform the steps ofencoding video data in a group structure of a plurality of framescorresponding to a compression-encoding process in a combination of aninter-frame predictive encoding process and a motion compensativeprocess, outputting compression-encoded or non-compressed audio data,transforming the data structure of encoded video data and audio datainto a file structure that can be processed by a computer softwareprogram without a dedicated hardware portion so that moving pictures andso forth are synchronously reproduced and multiplexing the encoded videodata and the audio data having the file structure, and recordingmultiplexed data having the file structure to a record medium, whereinthe file structure has a first data unit and a second data unit, thesecond data unit being a set of the first data units, and wherein atleast one data structure of the encoded video data is matched with thefirst data unit.

A fifteenth aspect of the present invention is a record medium on whicha program for recording video data and audio data to a rewritableoptical disc has been recorded, the program causing a computer toperform the steps of encoding video data in a group structure of aplurality of frames corresponding to a compression-encoding process in acombination of an inter-frame predictive encoding process and a motioncompensative process, outputting compression-encoded or non-compressedaudio data, transforming the data structure of encoded video data andaudio data into a file structure that can be processed by a computersoftware program without a dedicated hardware portion so that movingpictures and so forth are synchronously reproduced and multiplexing theencoded video data and the audio data having the file structure, andrecording multiplexed data having the file structure to an optical disc,wherein the file structure has a first data unit and a second data unit,the second data unit being a set of the first data units, and whereinthe second data unit is matched with a successive record length of whichdata is successively written to the optical disc.

According to the present invention, since at least one GOP in MPEGcompression video corresponds to a data unit of such as a QuickTimefile, each of plurality of types of data can be accessed and edited.When a data having a file structure is recorded to an optical disc,since the successive record length corresponds to a second data unit(for example, a chunk of QuickTime), the accessibility and editingefficiency can be improved.

The applicant of the present invention has the following US patents asprior patents thereof.

(1) U.S. Pat. No. 4,945,475

(2) U.S. Pat. No. 5,253,053

(3) U.S. Pat. No. 5,652,879

These and other objects, features and advantages of the presentinvention will become more apparent in light of the following detaileddescription of a best mode embodiment thereof, as illustrated in theaccompanying drawings.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a block diagram showing the structure of an embodiment of thepresent invention;

FIG. 2 is a schematic diagram showing an example of a QuickTime fileformat according to the present invention;

FIGS. 3A and 3B are schematic diagrams for explaining the relationbetween GOPs and a QuickTime file format according to the embodiment ofthe present invention;

FIGS. 4A and 4B are schematic diagrams for explaining the relationbetween a compression-encoded audio and a QuickTime file formataccording to the embodiment of the present invention;

FIGS. 5A and 5B are schematic diagrams for explaining another example ofthe relation between a compression-encoded audio and a QuickTime fileformat according to the embodiment of the present invention;

FIGS. 6A, 6B, 6C, and 6D are schematic diagrams for explaining therelation between GOPs of an MPEG video and a QuickTime file formataccording to the embodiment of the present invention;

FIGS. 7A, 7B, 7C, and 7D are schematic diagrams for explaining anexample of the relation between a compression-encoded audio and aQuickTime file format according to the embodiment of the presentinvention;

FIG. 8 is a schematic diagram for explaining an example of a recordingmethod for an optical disc according to the embodiment of the presentinvention; and

FIG. 9 is a schematic diagram for explaining another example of arecording method for an optical disc according to the embodiment of thepresent invention.

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS

Next, with reference to the accompanying drawings, an embodiment of thepresent invention will be described. FIG. 1 shows an example of thestructure of a digital recording and reproducing apparatus according tothe embodiment of the present invention. An input video signal issupplied to a video encoder 1 shown in FIG. 1. The video encoder 1compression-encodes the video signal. In addition, an input audio signalis supplied to an audio encoder 2. The audio encoder 2compression-encodes the audio signal. The compression-encoding methodapplied for the video signal and the audio signal is for example MPEG.Output signals of the video encoder 1 and the audio encoder 2 arereferred to as elementary streams.

When MPEG is used, the video encoder 1 is composed of a motionpredicting portion, a picture sequence rearranging portion, asubtracting portion, a DCT portion, a quantizing portion, a variablelength code encoding portion, and a buffer memory. The motion predictingportion detects a moving vector. The subtracting portion forms apredictive error between an input picture signal and a locally decodedpicture signal. The DCT portion transforms an output signal of thesubtracting portion corresponding to the DCT method. The quantizingportion quantizes an output signal of the DCT portion. The variablelength encoding portion encodes an output signal of the quantizingportion into a signal having a variable length. The buffer memoryoutputs the encoded data at a constant data rate. The picture sequencerearranging portion rearranges the sequence of pictures corresponding tothe encoding process. In other words, the picture sequence rearrangingportion rearranges the sequence of pictures so that after I and Ppictures are encoded, a B picture is encoded. The local decoding portionis composed of an inverse quantizing portion, an inverse DCT portion, anadding portion, a frame memory, and a motion compensating portion. Themotion compensating portion performs all of a forward predictingoperation, a reverse predicting operation, and a bidirectionalpredicting operation. When the intra encoding process is performed, thesubtracting portion directly passes data, not performs the subtractingprocess. The audio encoder 2 comprises a sub-band encoding portion andan adaptively quantized bit allocating portion.

As an example, in the case of a portable disc recording and reproducingapparatus with a built-in camera, a picture photographed by the camerais input as video data. In addition, a voice collected by a microphoneis input as audio data. The video encoder 1 and the audio encoder 2convert analog signals into digital signals. According to the embodimentof the present invention, a rewritable optical disc is used as a recordmedium. Examples of such an optical disc are a magneto-optical disc anda phase-change type disc. According to the embodiment of the presentinvention, a magneto-optical disc having a relatively small diameter isused.

Output signals of the video encoder 1 and the audio encoder 2 aresupplied to a file generator 5. The file generator 5 converts outputsignals of the video encoder 1 and the audio encoder 2 into a videoelementary stream and an audio elementary stream so that they can behandled corresponding to a computer software program for synchronouslyreproducing a moving picture and a sound without need to use a dedicatedhardware portion. According to the embodiment of the present invention,for example, as such a software program, QuickTime is used. A sequenceof data (video data, audio data, and text data) that varies on time baseand that is process by QuickTime is referred to as QuickTime movie. Thefile generator 5 multiplexes encoded video data and encoded audio data.To generate a QuickTime movie file, a system controlling microcomputer 9controls the file generator 5.

QuickTime movie files generated by the file generator 5 are successivelywritten to a memory 7 through a memory controller 8. When the systemcontrolling microcomputer 9 issues a data write request for a disc tothe memory controller 8, the memory controller 8 reads a QuickTime moviefile from the memory 7. In this example, the transfer rate of theencoding process for a QuickTime movie file is lower than that for datawritten to the disc. For example, the former is half of the latter.Thus, although QuickTime movie files are successively written to thememory 7, they are intermittently read from the memory 7 under thecontrol of the system controlling microcomputer 9 in such a manner thatthe memory 7 is prevented from overflowing or underflowing.

A QuickTime movie file that is read from the memory 7 through the memorycontroller 8 is supplied to an error correction encoder/decoder 11. Theerror correction encoder/decoder 11 temporarily writes a QuickTime moviefile to a memory 10. The error correction encoder/decoder 11 performs aninterleaving process and an error correction code encoding process so asto generate redundant data. The error correction encoder/decoder 11reads the QuickTime movie file with redundant data from the memory 10.

Output data of the error correction encoder/decoder 11 is supplied to adata modulator/demodulator 13. When digital data is recorded on thedisc, the data modulator/demodulator 13 modulates the data in such amanner that a clock signal can be easily extracted so that data can berecorded on a disc free from a problem such as an inter-codeinterference. For example, RLL (1, 7) can be used.

An output signal of the data modulator/demodulator 13 is supplied to amagnetic field modulating driver 14. In addition, a signal for drivingan optical pickup 23 is output to the magnetic field modulating driver14. The magnetic field modulating driver 14 drives a magnetic field head22 corresponding to the input signal so as to apply a magnetic field toan optical disc 20. The optical pickup 23 radiates a recording laserbeam to the optical disc 20. In such a manner, data is recorded on theoptical disc 20. The optical disc 20 is rotated at CLV (Constant LinearVelocity), CAV (Constant Angular Velocity), or ZCAV (Zone CLV of whichthe disc surface area is divided into for example three areas in each ofwhich the optical disc 20 is rotated at CAV in such a manner that thevelocity of the innermost area is the highest and the velocity of theoutermost area is the lowest) so that their linear velocities nearlybecome the same.

Since data that is intermittently read from the memory controller 8 isrecorded to the optical disc 20, data is not successively recorded. Inother words, after a predetermined amount of data is recorded, therecording operation is stopped until the next record request isreceived.

When the system controlling microcomputer 9 issues a request to a drivecontrolling microcomputer 12, it issues a request to a servo circuit 15so as to control the entire disc drive. Thus, the disc drive performs arecording operation. The servo circuit 15 performs a disc radial movingservo operation, a tracking servo operation, and a focus servo operationfor the optical pickup 23. In addition, the servo circuit 15 performs aspindle servo operation for a motor 21. In association with the systemcontrolling microcomputer 9, a user operation input portion (not shown)is disposed.

Next, the structure and operation of the reproducing portion will bedescribed. When data is reproduced, a reproducing laser beam is radiatedto the optical disc 20. A detector of the optical pickup 23 converts thereflected light of the optical disc 20 into a reproduction signal. Atracking error and a focus error are detected from an output signal ofthe detector of the optical pickup 23. The servo circuit 15 controls theoptical pickup 23 so that the optical pickup 23 is placed and focused ona desired track. In addition, the servo circuit 15 controls the radialmovement of the optical pickup 23 so that it reproduces data on adesired track of the optical disc 20.

As with the recording operation, when data is reproduced, the transferrate of data reproduced from the optical disc 20 is higher than that ofa QuickTime movie file. For example, the transfer rate of datareproduced form the optical disc 20 is twice as large as the transferrate of a QuickTime movie file. Likewise, data is not successivelyreproduced from the optical disc 20. In other words, an intermittentreproducing operation is performed in such a manner that after apredetermined amount of data is reproduced, the reproducing operation isstopped until the next reproducing request is received. As with therecording operation, in the reproducing operation, when the systemcontrolling microcomputer 9 issues a request to the drive controllingmicrocomputer 12, it issues a request to the servo circuit 15 so as tocontrol the entire disc drive.

The reproduction signal that is output from the optical pickup 23 isinput to the data modulator/demodulator 13. The datamodulator/demodulator 13 demodulates the reproduction signal. Thedemodulated data is supplied to the error correction encoder/decoder 11.The error correction encoder/decoder 11 temporarily writes thereproduction data to the memory 10. The error correction encoder/decoder11 performs a deinterleaving process and an error correcting process forthe reproduction data. The error-corrected QuickTime movie file iswritten to the memory 7 through the memory controller 8.

A QuickTime movie file written to the memory 7 is output to a filedecoder 6 in synchronization with a demultiplexing timing correspondingto a request issued by the system controlling microcomputer 9. Thesystem controlling microcomputer 9 supervises the amount of data that isreproduced from the optical disc 20 and written to the memory 7 and theamount of data that is read from the memory 7 and output to the filedecoder 6 so as to successively reproduce the video signal and the audiosignal. In addition, the system controlling microcomputer 9 controls thememory controller 8 and the drive controlling microcomputer 12 so as toread data from the optical disc 20 in such a manner that the memory 7does not overflow or underflow.

The file decoder 6 decodes a QuickTime movie file into a videoelementary stream and an audio elementary stream under the control ofthe system controlling microcomputer 9. The video elementary stream issupplied to a video decoder 3. The audio elementary stream is suppliedto an audio decoder 4. The video elementary stream and the audioelementary stream are synchronously output from the file decoder 6.

The video decoder 3 and the audio decoder 4 compression-decode the videoelementary stream and the audio elementary stream and generate a videooutput signal and an audio output signal, respectively. In this example,the video signal and the audio signal have been encoded corresponding toMPEG. A video output signal is output to a display (liquid crystaldisplay or the like) through a display driver and displayed as apicture. Likewise, an audio output signal is output to a speaker throughan audio amplifier and reproduced as a sound (these structural portionsare not shown).

The video decoder 3 is composed of a buffer memory, a variable lengthcode decoding portion, an inverse DCT portion, an inverse quantizingportion, an adding portion, and a local decoding portion. The addingportion adds an output signal of the inverse quantizing portion and alocal decoded output signal. The local decoding portion is composed of apicture sequence rearranging portion, a frame memory, and a motioncompensating portion. When an intra encoding process is performed, theadding portion directly passes data, not performs the adding process.Decoded data is output from the adding portion to the picture sequencerearranging portion. The picture sequence rearranging portion rearrangesthe decoded pictures in the original order.

As was described above, since the optical disc 20 on which data isrecorded is attachable and detachable, the data recorded on the opticaldisc 20 can be reproduced by another apparatus. For example, a personalcomputer that operates with QuickTime application software may read datarecorded on the optical disc 20 and reproduce video data and audio datatherefrom. It should be noted that the present invention can be appliedto an apparatus that handles only video data or only audio data.

Next, the embodiment of the present invention will be described in moredetail. First of all, with reference to FIG. 2, QuickTime will bedescribed in brief. QuickTime is an OS expansion function forreproducing a moving picture without need to use dedicated hardware.There are various data formats for QuickTime. In other words, audiodata, video data, MDI, and so forth of up to 32 tracks can besynchronously output.

A QuickTime movie file is roughly divided into two major portions thatare a movie resource portion and a movie data portion. The movieresource portion contains time data necessary for reproducing theQuickTime movie file and information necessary for referencing realdata. The movie data portion contains real data of video data and realdata of audio data.

One QuickTime movie file can contain different types of medium data suchas a sound, a video, and a text as independent tracks that are a soundtrack, a video track, and a text track, respectively. These independenttracks are strictly controlled on time base. Each track has a medium forreferencing the compression method of the real data and the display timeperiod thereof. The medium contains the minimum sample size of the realdata in the movie data portion, the position of a chunk that is a blockof a plurality of samples, and the display duration of each sample.

FIG. 2 shows an example of a QuickTime file that handles audio data andvideo data. The largest structural portions of the QuickTime file are amovie resource portion and a movie data portion. The movie resourceportion contains the duration necessary for reproducing the file anddata necessary for referencing the real data. The movie data portioncontains real data of video data, audio data, and so forth.

Next, the movie resource portion will be described in detail. The movieresource portion contains a movie header 41 and tracks. The movie header41 contains general file information. There are a plurality of trackscorresponding to the number of types of data. FIG. 2 shows an example ofthe internal structure of a video track 50 in detail. The video track 50contains a track header 42 and a medium portion. The track header 42contains general track information. The medium portion contains a mediumheader 43, a medium handler 44, and a medium information portion. Themedium header 43 contains general medium information. The medium handler44 contains medium data handling information.

The medium information portion contains a medium handler 45, a datahandler 46, data information 47, and a sample table. The medium handler45 contains picture medium information. The data handler 46 containspicture data handling information. The data information 47 contains datainformation. The sample table contains a sample description, atime-to-sample, a sample size 48, a sample-to-chunk, a chunk offset 49,a sync sample, and so forth. The sample description contains eachsample. The time-to-sample represents the relation between a sample andthe time base. The sample size 48 represents the size of the sample. Thesample-to-chunk represents the relation between the sample and thechunk. The chunk offset 49 represents the start byte position of thechunk in the movie file. The sync sample contains synchronousinformation. Likewise, an audio track 51 has a structure (not shown)similar to that of a video track.

On the other hand, the movie data portion contains audio data encodedcorresponding to for example MPEG Audio Layer 2 and picture data encodedin the compression-encoding method corresponding to for example MPEG(Moving Picture Expert Group) method in the unit of chunks each of whichis composed of a predetermined number of samples. However, it should benoted that the present invention is not limited to such an encodingmethod. In addition, the moving data portion may contain linear datathat has not been compression-encoded.

Each track of the movie resource portion is correlated with datacontained in the movie data portion. In other words, in the exampleshown in FIG. 3, since audio data and video data are handled, the movieresource portion contains a video track and an audio track. The moviedata portion contains real data of the audio data and real data of thevideo data. When other types of data are handled, the movie resourceportion contains their tracks and the movie data portion contains realdata thereof. For example, when a text and MIDI are handled, the movieresource portion contains tracks of the text and the MIDI and the moviedata portion contains real data thereof.

Next, a method for converting compressed video data (video elementarystream) and compressed audio data (audio elementary stream) into aQuickTime file format in the case that MPEG2 is used as a decodingmethod for data that has been compression-encoded will be described.First of all, MPEG will be described. MPEG has a hierarchical structureof six layers that are a sequence layer, a GOP layer, a picture layer, aslice layer, a macro block layer, and a block layer in the order of thehighest hierarchical level. A header is placed at the beginning of eachof the six layers. For example, a sequence header is a header placed atthe beginning of the sequence layer. The sequence header contains asequence start code, a horizontal screen size, a vertical screen size,an aspect ratio, a picture rate, a bit rate, a VBV buffer size, arestriction parameter bit, a load flag of two quantized matrixes, and acontent.

According to MPEG, there are three picture types I, P, and B. In an Ipicture (Intra-coded picture), when a picture signal is encoded,information of only one picture is used. Thus, when an encoded picturesignal is decoded, information of only the I picture is used. In a Ppicture (Predictive-coded picture), as a predictive picture (a referencepicture for obtaining a difference with the current P picture), an Ipicture or another P picture that has been decoded is temporallyfollowed by the current P picture. The difference between the current Ppicture and a motion-compensated predictive picture is encoded for eachmacro block. Alternatively, the current P picture is encoded for eachmacro block without obtaining the difference of such pictures. One ofthose methods is selected whichever higher efficiency is obtained. In aB picture (Bidirectionally predictive-coded picture), as predictivepictures (reference pictures for obtaining a difference with the currentB picture), three types of reference pictures are used. The first typereference picture is an I picture or a P picture that has been decodedand that is temporally followed by the current B picture. The secondtype reference picture is an I picture or a P picture that has beendecoded and that is temporally preceded by the current B picture. Thethird type reference picture is an interpolated picture of the firsttype reference picture and the second type reference picture. Thedifference between the current B picture and each of the three typereference pictures that have been motion-compensated is encoded for eachmacro block. Alternatively, the current B picture is encoded for eachmacro block without obtaining such a difference. One of those methods isselected whichever higher efficiency is obtained.

Thus, there are a frame intra-coded macro block, a forward inter-framepredictive macro frame (a future macro block is predicted with a pastmacro block), a backward inter-frame predictive macro block (a pastmacro block is predicted with a future macro block), and a bidirectionalmacro block (a current macro block is predicted with both a future macroblock and a past macro block). All macro blocks in an I picture areintra-frame coded macro blocks. A P picture contains intra-frame codedmacro blocks and forward inter-frame predictive macro blocks. A Bpicture contains the above-described four types of macro blocks.

In MPEG, a GOP (Group Of Pictures) structure that is a group of picturesis defined so that data can be random-accessed. In MPEG, a GOP isdefined as follows. The first picture of one GOP is an I picture. Thelast picture of one GOP is an I picture or a P picture. A GOP that ispredicted with the last I or P picture of the preceding GOP ispermitted. A GOP that can be decoded without a picture of the precedingGOP is referred to as closed GOP. According to the embodiment, as astructure of a closed GOP, each GOP can be edited.

In MPEG audio (compressing method), three modes of layer 1, layer 2, andlayer 3 have been defined. In layer 1, for example 32 sub-band encodingoperation and adaptive bit allocating operation are performed. One audiodecoding unit is composed of 384 samples. One audio decoding unit is oneaudio frame of an audio bit stream. The audio decoding unit is theminimum unit of which encoded data is decoded to audio data. Likewise,the video decoding unit corresponding to one video frame has beendefined. In NTSC system, one video frame is equivalent to 1/30 seconds.Normally, the bit rate of stereo audio in layer 1 is 256 kbps. In layer2, a 32 sub-band encoding operation and an adaptive bit allocatingoperation are performed. One audio decoding unit is composed of 1152samples. Normally, the bit rate of stereo audio in layer 2 is 192 kbps.

The file generator 5 converts video data and audio data that have beencompressed corresponding to MPEG into a file structure corresponding tothe above-described QuickTime file format. FIGS. 3A and 3B show therelation among video frames, GOPs, and units of samples and chunks ofthe QuickTime file format. As was described above, one sample is theminimum unit of movie data. One chunk is a unit of which a plurality ofsamples are collected as a block.

As shown in FIG. 3A, for example 15 video frames of an original videosignal are compression-encoded corresponding to MPEG2 and thereby oneGOP is generated. 15 video frames are equivalent to 0.5 seconds. EachGOP is preferably structured as a closed GOP. A sequence header isplaced at the beginning of each GOP. The sequence header and one GOPcompose one video decoding unit. Since a sequence header is placed toeach GOP, each sample can be directly edited and decoded with QuickTime.The video encoder 1 shown in FIG. 1 outputs an MPEG video elementarystream shown in FIG. 3A.

As shown in FIG. 3B, one video decoding unit is treated as one sample ofthe QuickTime file format. Six chronologically successive samples (forexample, sample #0 to sample #5) are treated as one video chunk (forexample, chunk #0). The duration of one chuck is 3 seconds.Alternatively, six GOPs may be treated as one video chunk so that onechuck corresponds to one sample. In this case, the duration of one chuckis 3 seconds.

FIGS. 4A and 4B show the relation among audio frames encodedcorresponding to MPEG audio layer 2, GOPs, and units of samples andchunks in the QuickTime file format. In layer 2, 1152 audiosamples/channel are treated as one audio frame. As shown in FIG. 4A, instereo, 1152 audio samples×2 channels are encoded in layer 2 and treatedas one audio decoding unit. One audio decoding unit contains data of 384bytes×2 channels that have been compression-encoded. The audio decodingunit contains a header and information necessary for decoding theencoded data (allocation, scale factor, and so forth).

As shown in FIG. 4B, one audio decoding unit is treated as one sample ofthe QuickTime file format. Thus, each audio sample can be decoded withQuickTime. 125 chronological successive samples (for example, sample #0to sample #124) are treated as one audio chunk (for example, chuck #0).The duration of one chuck is 3 seconds in the case that the audiosampling frequency is 48 kHz.

In FIGS. 3A, 3B, 4A, and 4B, a video data file and an audio data fileare separately shown. The file generator 5 multiplexes a video data fileand an audio data file as one data stream and thereby generates aQuickTime movie file. In the QuickTime movie file, video chunks andaudio chucks are alternatively placed on time base. In this case, videochunks and audio chunks are placed in such a manner that a video chuckadjacents to an audio chunk corresponding thereto. As was describedabove, the duration of video data of one video chunk is equal to theduration of audio data of one audio chunk (for example, 3 seconds).

As another example of the audio compression-encoding method, ATRAC(Adaptive Transform Acoustic Coding) used for Mini Disc may be used. InATRAC, audio data of 16 bits sampled at 44.1 kHz is processed. Theminimum data unit processed in ATRACK is one sound unit. In stereo, onesound unit is composed of 512 samples×16 bits×2 channels.

When ATRACK is used as an audio compression-encoding method, as shown inFIG. 5A, one sound unit is compressed to an audio decoding unit of 212bytes×2 channels. As shown in FIG. 5B, one audio decoding unit istreated as one sample in the QuickTime file format. 64 samples aretreated as one chunk in the QuickTime file format.

According to the present invention, the audio data may be recorded on anon-compression basis. The non-compression method is referred to aslinear PCM. Likewise, in linear PCM, 512 audio samples are treated asone audio decoding unit. One audio decoding unit is treated as onesample in the QuickTime file format.

FIGS. 6A, 6B, 6C, and 6D show a QuickTime file format for video data inthe case that video data and audio data are multiplexed. As shown inFIG. 6A, the period of a video frame is t0 seconds and the number offrames of one GOP is f0. When original video data is encodedcorresponding to MPEG2, an MPEG video elementary stream shown in FIG. 6Bis formed. As was described above, a sequence header (SH) is placed toeach GOP.

As shown in FIG. 6C, one GOP with a sequence header is treated as onesample in the QuickTime file format. The length of one sample isreferred to as sample size. With a plurality of samples (for example,six samples), one chunk is composed in the QuickTime file format. Asshown in FIG. 6D, video chunks and audio chunks are alternately placedon time base and thereby multiplexed. As a result, a QuickTime moviefile is formed. The beginning of each video chunk of the QuickTime moviefile is referred to as video chunk offset. The video chunk offset isrepresented by the number of bytes from the beginning of the file to thebeginning of the video chunk.

FIGS. 7A, 7B, 7C, and 7D show a QuickTime file format of audio data inthe case that video data and audio data are multiplexed. As shown inFIG. 7A, an original audio signal is digitized. One audio frame containsf0 audio samples×n channels. When the original audio data iscompression-encoded corresponding to MPEG audio, an MPEG audioelementary stream shown in FIG. 7B is formed.

As shown in FIG. 7C, for example one audio decoding unit is treated asone sample of the QuickTime file format. The size of one sample isreferred to as sample size. A plurality of samples (for example, 125samples) composes one chunk of the QuickTime file format. As shown inFIG. 7D, video chunks and audio chunks are alternately placed andthereby multiplexed. As a result, a QuickTime movie file is formed. Thebeginning of each audio chunk of a QuickTime movie file is referred toas audio chunk offset. The audio chunk offset is represented by thenumber of bytes from the beginning of the file to the beginning of theaudio chunk. The duration of each video chunk is the same as theduration of each audio chunk. For example, the duration is 3 seconds.

The sample size of a video sample, the sample size of an audio sample,the offset value of a video chunk, and the offset value of an audiochunk are contained in the resource of a QuickTime movie file. With theresource, each sample of each chunk can be designated and edited (in theencoding unit).

Next, as mentioned above, a recording method for recording a QuickTimemovie file of which video chunks and audio chunks have been multiplexed(interleaved) to the optical disc 20 will be described. As describedabove, one QuickTime file format is roughly divided into two majorportions that are a movie resource portion and a movie data portion.When a QuickTime movie file is recorded to the optical disc 20, as shownin FIG. 8, the movie resource is matched with the successive recordlength. In addition, each chunk (video chunk or audio chunk) of themovie data (real data) is matched with the successive record length ofthe disc. The successive record length means the length of which datacan be written to successive addresses without a jumping operation ofthe optical pickup 23.

FIG. 9 shows another example of which a QuickTime movie file is recordedon the optical disc 20. As described above, when video chunks and audiochunks have been multiplexed, a pair of an audio chunk and an adjacentvideo that corresponds thereto is matched with the successive recordlength.

As shown in FIGS. 8 and 9, the position of the successive record lengthis physically not continuous. Thus, after the movie resource isreproduced, when the first audio chunk and video chunk are reproduced(namely, data of two successive record lengths is reproduced), a trackjump takes place. However, as was described above, since the datatransfer rate of write/read operation is higher (for example, two timeshigher) than the data transfer rate of a QuickTime movie file, even ifdata is intermittently read, successive QuickTime movie files can bereproduced.

Thus, the transfer rate of a QuickTime movie file, the read rate of datafrom the optical disc, the duration of the successive record length, andthe seek time of the disc drive (the seek time is the duration necessaryfor a track jump from one track to another track) mutually relate. Thus,the duration of video data and audio data recorded in the successiverecord length can be selected in various manners from other than 3seconds. It is preferred that in the duration for video frames of videodata recorded in the successive record length, an integer number ofaudio samples are placed.

When the duration of video data and the duration of audio data recordedin the successive record length are not fixed, the movie resource of theQuickTime movie file contains the successive record length. For example,the movie resource contains the number of frames in one video chunk andthe number of samples in one audio sample.

In the above description, the present invention is applied to a portabledisc recording and reproducing apparatus with built-in camera. It shouldbe noted that the present invention can be applied to other apparatuses.For example, the present invention can be applied to a digital stillcamera and a digital audio recorder and player.

In addition, according to the present invention, part or all of thehardware structure shown in the block diagram of FIG. 1 can beaccomplished by software. The software is stored on a computer readablerecord medium such as a CD-ROM.

In the above embodiment, the present invention was applied to QuickTime.However, it should be noted that the present invention can be applied tocomputer software that allows a plurality of sequences of data to besynchronously reproduced without need to use dedicated hardware.

According to the present invention, since at least one GOP in MPEGcompression video corresponds to the first data unit (sample) of a filesuch as QuickTime, each of plurality of types of data can be accessedand edited. In addition, when data with a file structure is recorded onan optical disc, since the successive record length corresponds to thesecond data unit (for example, a chunk of QuickTime), the accessibilityand editing efficiency can be improved.

Although the present invention has been shown and described with respectto a best mode embodiment thereof, it should be understood by thoseskilled in the art that the foregoing and various other changes,omissions, and additions in the form and detail thereof may be madetherein without departing from the spirit and scope of the presentinvention.

1. A recording apparatus for recording data to a record medium,comprising: video encoding means for encoding video data in a groupstructure of a plurality of frames by performing a compression-encodingprocess; video data output means for outputting encoded video data bysaid video encoding means; audio data output means for outputtingcompression-encoded or non-compressed audio data; management datagenerating means for generating management data which manages saidencoded video data and said audio data; transforming means fortransforming the data structure of encoded video data that is outputfrom said video data output means, audio data that is output from saidaudio data output means, and the management data into a file structure;and recording means for recording said transformed encoded video data,the audio data, and the management data to a record medium, wherein thefile structure contains a first video data unit which corresponds to apredetermined number of frames of said encoded video data outputted fromsaid video output means, a first audio data unit which corresponds to apredetermined number of sound samples of said audio data, a second videodata unit which comprises a plurality of said first video data units,and a second audio data unit which comprises a plurality of said firstaudio data units, wherein said second video data unit and said secondaudio data unit are recorded on a successive location of said recordmedium respectively; and wherein said management data includes videotrack data and audio track data of independent data structure, andwherein the video track data contains a size quantity of the first videodata unit and a start position of the second video data unit and theaudio track data contains a size quantity of said first audio data unitand a start position of said second audio data unit, respectively.
 2. Arecording apparatus for recording video data to a rewritable opticaldisc, comprising: video encoding means for encoding video data in agroup structure of a plurality of frames by performing acompression-encoding process; video data output means for outputtingencoded video data by said video encoding means; audio data output meansfor outputting compression-encoded or non-compressed audio data;management data generating means for generating management data whichmanages said encoded video data and said audio data; transforming meansfor transforming the data structure of encoded video data that is outputfrom said video data output means, audio data that is output from saidaudio output means, and the management data into a file structure; andrecording means for recording said transformed encoded video data, theaudio data, and the management data to a record medium, wherein the filestructure contains a first video data unit which corresponds to apredetermined number of frames of said encoded video data outputted fromsaid video output means, a first audio data unit which corresponds to apredetermined number of sound samples of said audio data, a second videodata unit which comprises a plurality of said first video data units,and a second audio data unit which comprises a plurality of said firstaudio data units, wherein said second video data unit and said secondaudio data unit are recorded on a successive location of said recordmedium respectively; and wherein said management data includes videotrack data and audio track data of independent data structure, andwherein the video track data contains a size quantity of the first videodata unit and a start position of the second video data unit and theaudio track data contains a size quantity of said first audio data unitand a start position of said second audio data unit, respectively. 3.The recording apparatus as set forth in claim 1, wherein thecompression-encoding process is MPEG, wherein the group structure is GOPstructure, and wherein data of which a sequence header is added to eachGOP is matched with the first data unit.
 4. The recording apparatus asset forth in claim 1, wherein the duration of the encoded video data ofthe second video data unit is the same as the duration of the encodedaudio data of the second audio data unit in the transformed data.
 5. Therecording apparatus as set forth in claim 1, wherein the encoded videodata of the second video data unit and the encoded audio data of thesecond audio data unit are alternately placed in the file structure,each of the encoded video data of the second video data unit and theencoded audio data of the second audio data unit being matched with asuccessive record length on the record medium.
 6. A recording method forrecording video data to a record medium, comprising the steps of:encoding video data in a group structure of a plurality of frames byperforming a compression-encoding process; outputting encoded video databy said encoding step; outputting compression-encoded or non-compressedaudio data; generating management data which manages said encoded videodata and said audio data; transforming the data structure of encodedvideo data that is output from said video data output step, audio datathat is output from said audio output step, and the management data intoa file structure; and recording said transformed encoded video data, theaudio data, and the management data to a record medium, wherein the filestructure contains a first video data unit which corresponds to apredetermined number of frames of said encoded video data outputted fromsaid video output step, a first audio data unit which corresponds to apredetermined number of sound samples of said audio data, a second videodata unit which comprises a plurality of said first video data units,and a second audio data unit which comprises a plurality of said firstaudio data units, wherein said second video data unit and said secondaudio data unit are recorded on a successive location of said recordmedium respectively; and wherein said management data includes videotrack data and audio track data of independent data structure, andwherein the video track data contains a size quantity of the first videodata unit and a start position of the second video data unit and theaudio track data contains a size quantity of said first audio data unitand a start position of said second audio data unit respectively.
 7. Arecording method for recording video data to a rewritable optical disc,comprising the steps of: encoding video data in a group structure of aplurality of frames by performing a compression-encoding process;outputting encoded video data by said encoding step; outputtingcompression encoded or non-compressed audio data; generating managementdata which manages said encoded video data and said audio data;transforming the data structure of encoded video data that is outputfrom said video data output step, audio data that is output from saidaudio output step, and the management data into a file structure; andrecording said transformed encoded video data, the audio data, and themanagement data to a record medium, wherein the file structure containsa first video data unit which corresponds to a predetermined number offrames of said encoded video data outputted from said video output step,a first audio data unit which corresponds to a predetermined number ofsound samples of said audio data, a second video data unit whichcomprises a plurality of said first video data units, and a second audiodata unit which comprises a plurality of said first audio data units,wherein said second video data unit and said second audio data unit arerecorded on a successive location of said record medium respectively;and wherein said management data includes video track data and audiotrack data of independent data structure, and wherein the video trackdata contains a size quantity of the first video data data and a startposition of the second video data unit and the audio track data containsa size quantity of said first audio data unit and a start position ofsaid second audio data unit, respectively.
 8. A recording method forrecording video data and audio data to a record medium, comprising thesteps of: encoding video data and audio data in a group structure of aplurality of frames by performing a compression-encoding process;outputting encoded video data by said encoding step; outputtingcompression-encoded or non-compressed audio data; generating managementdata which manages said encoded video data and said audio data;transforming the data structure of encoded video data that is outputfrom said video data output step, audio data that is output from saidaudio output step, and the management data into a file structure; andrecording said transformed encoded video data, the audio data, and themanagement data to a record medium, wherein the file structure containsa first video data unit which corresponds to a predetermined number offrames of said encoded video data outputted from said video output step,a first audio data unit which corresponds to a predetermined number ofsound samples of said audio data, a second video data unit whichcomprises a plurality of said first video data units, and a second audiodata unit which comprises a plurality of said first audio data units,wherein said second video data unit and said second audio data unit arerecorded on a successive location of said record medium respectively;and wherein said management data includes video track data and audiotrack data of independent data structure, and wherein the video trackdata contains a size quantity of the first video data unit and a startposition of the second video data unit and the audio track data containsa size quantity of said first audio data unit and a start position ofsaid second audio data unit, respectively.
 9. A computer readable mediumembodying a program causing a computer to perform the steps of: encodingvideo data in a group structure of a plurality of frames by performing acompression-encoding process; outputting encoded video data by saidencoding step; outputting compression encoded or non-compressed audiodata; generating management data which manages said encoded video dataand said audio data; transforming the data structure of encoded videodata that is output from said video data output step, audio data that isoutput from said audio output step, and the management data into a filestructure; and recording said transformed encoded video data, the audiodata, and the management data to a record medium, wherein the filestructure contains a first video data unit which corresponds to apredetermined number of frames of said encoded video data outputted fromsaid video output step, a first audio data unit which corresponds to apredetermined number of sound samples of said audio data, a second videodata unit which comprises a plurality of said first video data units,and a second audio data unit which comprises a plurality of said firstaudio data units, wherein said second video data unit and said secondaudio data unit are recorded on a successive location of said recordmedium respectively; and wherein said management data includes videotrack data and audio track data of independent data structure, andwherein the video track data contains a size quantity of the first videodata unit and a start position of the second video data unit and theaudio track data contains a size quantity of said first audio data unitand a start position of said second audio data unit, respectively.
 10. Acomputer readable medium embodying a program causing a computer toperform the steps of: encoding video data in a group structure of aplurality of frames by performing a compression-encoding process;outputting encoded video data by said encoding step; outputtingcompression encoded or non-compressed audio data; generating managementdata which manages said encoded video data and said audio data;transforming the data structure of encoded video data that is outputfrom said video data output step, audio data that is output from saidaudio output step, and the management data into a file structure; andrecording said transformed encoded video data, the audio data, and themanagement data to a record medium, wherein the file structure containsa first video data unit which corresponds to a predetermined number offrames of said encoded video data outputted from said video output step,a first audio data unit which corresponds to a predetermined number ofsound samples of said audio data, a second video data unit whichcomprises a plurality of said first video data units, and a second audiodata unit which comprises a plurality of said first audio data units,wherein said second video data unit and said second audio data unit arerecorded on a successive location of said record medium respectively;and wherein said management data includes video track data and audiotrack data of independent data structure, and wherein the video trackdata contains a size quantity of the first video data unit and a startposition of the second video data unit and the audio track data containsa size quantity of said first audio data unit and a start position ofsaid second audio data unit, respectively.
 11. A computer readablemedium embodying a program causing a computer to perform the steps of:encoding video data and audio data in a group structure of a pluralityof frames by performing a compression-encoding process; outputtingencoded video data by said encoding step; outputting compression encodedor non-compressed audio data; generating management data which managessaid encoded video data and said audio data; transforming the datastructure of encoded video data that is output from said video dataoutput step, audio data that is output from said audio output step, andthe management data into a file structure; and recording saidtransformed encoded video data, the audio data, and the management datato a record medium, wherein the file structure contains a first videodata unit which corresponds to a predetermined number of frames of saidencoded video data outputted from said video output step, a first audiodata unit which corresponds to a predetermined number of sound samplesof said audio data, a second video data unit which comprises a pluralityof said first video data units, and a second audio data unit whichcomprises a plurality of said first audio data units, wherein saidsecond video data unit and said second audio data unit are recorded on asuccessive location of said record medium respectively; and wherein saidmanagement data includes video track data and audio track data ofindependent data structure, and wherein the video track data contains asize quantity of the first video data unit and a start position of thesecond video data unit and the audio track data contains a size quantityof said first audio data unit and a start position of said second audiodata unit, respectively.
 12. A computer readable medium embodying aprogram causing a computer to perform the steps of: encoding video dataand audio data in a group structure of a plurality of frames byperforming a compression-encoding process; outputting encoded video databy said encoding step; outputting compression encoded or non-compressedaudio data; generating management data which manages said encoded videodata and said audio data; transforming the data structure of encodedvideo data that is output from said video data output step, audio datathat is output from said audio output step, and the management data intoa file structure; and recording said transformed encoded video data, theaudio data, and the management data to a record medium, wherein the filestructure contains a first video data unit which corresponds to apredetermined number of frames of said encoded video data outputted fromsaid video output step, a first audio data unit which corresponds to apredetermined number of sound samples of said audio data, a second videodata unit which comprises a plurality of said first video data units,and a second audio data unit which comprises a plurality of said firstaudio data units, wherein said second video data unit and said secondaudio data unit are recorded on a successive location of said recordmedium respectively; and wherein said management data includes videotrack data and audio track data of independent data structure, andwherein the video track data contains a size quantity of the first videodata unit and a start position of the second video data unit and theaudio track data contains a size quantity of said first audio data unitand a start position of said second audio data unit, respectively. 13.The recording apparatus as set forth in claim 1, wherein said firstvideo data unit and said first audio data unit correspond to theencoding unit which can be decoded respectively.
 14. The recordingapparatus as set forth in claim 1, wherein said transforming meanstransforms the data structure of said encoded video data and said audiodata into said file structure which contains said first video unit, saidfirst audio data unit, said second video data unit, said second audiodata unit, and a resource data which includes at least the size of saidfirst video data unit and said fist audio data unit; and said recordingmeans records said resource data to said record medium.
 15. Therecording apparatus as set forth in claim 1 wherein said recording meansrecords said transformed encoded video data and said audio data to saidrecord medium so that said second video data unit and said second audiodata unit are recorded on a successive record length of said recordmedium respectively.
 16. The recording apparatus of claim 1, whereinsaid recording means records said transformed encoded video data andsaid audio data to said record medium so that said second video unit andsaid second audio unit are placed in such a manner that said secondvideo data unit is adjacent to said second audio data unit correspondingthereto.
 17. The recording apparatus according to claim 1, wherein thevideo track data contains information representing a relationshipbetween the first video data unit and a time base and the audio trackdata contains information representing a relationship between the firstaudio data unit and a time base.
 18. A recording apparatus for recordingdata to a record medium, comprising: a video encoder for encoding videodata in a group structure of a plurality of frames by performing acompression-encoding process; a video output for outputting encodedvideo data by said video encoder; an audio output for outputtingcompression-encoded or non-compressed audio data; a data generator forgenerating management data which manages said encoded video data andsaid audio data; a transformer for transforming the data structure ofencoded video data that is output from said video output, audio datathat is output from said audio output, and the management data into afile structure; and a recorder for recording said transformed encodedvideo data, the audio data, and the management data to a record medium,wherein the file structure contains a first video data unit whichcorresponds to a predetermined number of frames of said encoded videodata outputted from said video output, a first audio data unit whichcorresponds to a predetermined number of sound samples of said audiodata, a second video data unit which comprises a plurality of said firstvideo data units, and a second audio data unit which comprises aplurality of said first audio data units, wherein said second video dataunit and said second audio data unit are recorded on a successivelocation of said record medium respectively; and wherein said managementdata includes video track data and audio track data of independent datastructure, and wherein the video track data contains a size quantity ofthe first video data unit and a start position of the second video dataunit and the audio track data contains a size quantity of said firstaudio data unit and a start position of said second audio data unit,respectively.